A Convex Formulation for Learning from Crowds

نویسندگان

چکیده

Recently crowdsourcing services are often used to collect a large amount of labeled data for machine learning, since they provide us an easy way get labels at very low cost and in short period. The use has introduced new challenge that is, coping with the variable quality crowd-generated data. Although there have been many recent attempts address problem multiple workers, only few existing methods consider learning classifiers directly from such noisy All these modeled true as latent variables, which resulted non-convex optimization problems. In this paper, we propose convex formulation crowds without estimating by introducing personal models individual crowd workers. We also devise efficient iterative method solving problems exploiting conditional independence structures classifiers. evaluate proposed against three competing on synthetic sets real crowdsourced set demonstrate outperforms other methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Convex Formulation for Learning from Crowd

Recently crowdsourcing services are often used to collect a large amount of labeled data for machine learning, since they provide us an easy way to get labels at very low cost and in a short period. The use of crowdsourcing has introduced a new challenge in machine learning, that is, coping with the variable quality of crowd-generated data. Although there have been many recent attempts to addre...

متن کامل

Convex Formulation for Learning from Positive and Unlabeled Data

We discuss binary classification from only positive and unlabeled data (PU classification), which is conceivable in various real-world machine learning problems. Since unlabeled data consists of both positive and negative data, simply separating positive and unlabeled data yields a biased solution. Recently, it was shown that the bias can be canceled by using a particular non-convex loss such a...

متن کامل

Clustered Multi-Task Learning: A Convex Formulation

In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they...

متن کامل

Active Learning from Crowds

Obtaining labels can be expensive or timeconsuming, but unlabeled data is often abundant and easier to obtain. Most learning tasks can be made more efficient, in terms of labeling cost, by intelligently choosing specific unlabeled instances to be labeled by an oracle. The general problem of optimally choosing these instances is known as active learning. As it is usually set in the context of su...

متن کامل

Learning From Crowds

For many supervised learning tasks it may be infeasible (or very expensive) to obtain objective and reliable labels. Instead, we can collect subjective (possibly noisy) labels from multiple experts or annotators. In practice, there is a substantial amount of disagreement among the annotators, and hence it is of great practical interest to address conventional supervised learning problems in thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v26i1.8105